Learning reduced kinetic Monte Carlo models of complex chemistry from molecular dynamics
نویسندگان
چکیده
We propose a novel statistical learning framework for automatically and efficiently building reduced kinetic Monte Carlo (KMC) models of large-scale elementary reaction networks from data generated by a single or few molecular dynamics simulations (MD). Existing approaches for identifying species and reactions from molecular dynamics typically use bond length and duration criteria, where bond duration is a fixed parameter motivated by an understanding of bond vibrational frequencies. In contrast, we show that for highly reactive systems, bond duration should be a model parameter that is chosen to maximize the predictive power of the resulting statistical model. We demonstrate our method on a high temperature, high pressure system of reacting liquid methane, and show that the learned KMC model is able to extrapolate more than an order of magnitude in time for key molecules. Additionally, our KMC model of elementary reactions enables us to isolate the most important set of reactions governing the behavior of key molecules found in the MD simulation. We develop a new data-driven algorithm to reduce the chemical reaction network which can be solved either as an integer program or efficiently using L1 regularization, and compare our results with simple count-based reduction. For our liquid methane system, we discover that rare reactions do not play a significant role in the system, and find that less than 7% of the approximately 2000 reactions observed from molecular dynamics are necessary to reproduce the molecular concentration over time of methane. The framework described in this work paves the way towards a genomic approach to studying complex chemical systems, where expensive MD simulation data can be reused to contribute to an increasingly large and accurate genome of elementary reactions and rates.
منابع مشابه
Gyration Radius and Energy Study at Different Temperatures for Acetylcholine Receptor Protein in Gas Phase by Monte Carlo, Molecular and Langevin Dynamics Simulations
The determination of gyration radius is a strong research for configuration of a Macromolecule. Italso reflects molecular compactness shape. In this work, to characterize the behavior of theprotein, we observe quantities such as the radius of gyration and the average energy. We studiedthe changes of these factors as a function of temperature for Acetylcholine receptor protein in gasphase with n...
متن کاملEnergy study at different solvents for potassium Channel Protein by Monte Carlo, Molecular and Langevin Dynamics Simulations
Potassium Channels allow potassium flux and are essential for the generation of electric current acrossexcitable membranes. Potassium Channels are also the targets of various intracellular controlmechanisms; such that the suboptimal regulation of channel function might be related to pathologicalconditions. Realistic studies of ion current in biologic channels present a major challenge for compu...
متن کاملKinetic Monte Carlo Study of Biodiesel Production through Transesterification of Brassica Carinata Oil
In the present study, the kinetics of biodiesel production through transesterification of Brassica carinata oil with methanol in the presence of Potassium Hydroxide is investigated by kinetic Monte Carlo simulation. The obtained results from simulation agree qualitatively with the existing experimental data. The kinetics data for each step of suggested mechanism are confirmed by simulation. By ...
متن کاملKinetic Monte Carlo Simulation of Oxalic Acid Ozonationover Lanthanum-based Perovskitesas Catalysts
Kinetic Monte Carlo simulation was applied to investigation of kinetics and mechanism of oxalic acid degradation by direct and heterogeneous catalytic ozonation. La-containing perovskites including LaFeO3, LaNiO3, LaCoO3 and LaMnO3 was studied as catalyst for oxalic acid ozonation. The reaction kinetic mechanisms of each abovementioned catalytic systems has been achieved. The rate constants val...
متن کاملInvestigation of Monte Carlo, Molecular Dynamic and Langevin dynamic simulation methods for Albumin- Methanol system and Albumin-Water system
Serum Albumin is the most aboundant protein in blood plasma. Its two major roles aremaintaining osmotic pressure and depositing and transporting compounds. In this paper,Albumin-methanol solution simulation is carried out by three techniques including MonteCarlo (MC), Molecular Dynamic (MD) and Langevin Dynamic (LD) simulations. Byinvestigating energy changes by time and temperature (between 27...
متن کامل